Internet Info 1997 December

home *** CD-ROM | disk | FTP | other *** search

/ Internet Info 1997 December / Internet_Info_CD-ROM_Walnut_Creek_December_1997.iso / ietf / urn / urn-archives / urn-ietf.archive.9611 / 000070_owner-urn-ietf _Wed Nov 6 15:52:05 1996.msg < prev next >

Wrap

Internet Message Format | 1997-02-19 | 6KB

Received: (from daemon@localhost) by services.bunyip.com (8.6.10/8.6.9) id PAA17065 for urn-ietf-out; Wed, 6 Nov 1996 15:52:05 -0500 Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.6.10/8.6.9) with SMTP id PAA17059 for <urn-ietf@services.bunyip.com>; Wed, 6 Nov 1996 15:51:56 -0500 Received: from IG.CS.UTK.EDU by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA13528 (mail destined for urn-ietf@services.bunyip.com); Wed, 6 Nov 96 15:51:38 -0500 Received: from localhost by ig.cs.utk.edu with SMTP (cf v2.11c-UTK) id PAA10773; Wed, 6 Nov 1996 15:32:28 -0500 (EST) Message-Id: <199611062032.PAA10773@ig.cs.utk.edu> X-Uri: http://www.cs.utk.edu/~moore/ From: Keith Moore <moore@cs.utk.edu> To: Martin J Duerst <mduerst@ifi.unizh.ch> Cc: moore@cs.utk.edu (Keith Moore), tallen@fsc.fujitsu.com, urn-ietf@bunyip.com Subject: Re: [URN] I18N does not belong in URNs In-Reply-To: Your message of "Wed, 06 Nov 1996 18:02:19 +0100." <"josef.ifi..172:06.10.96.17.02.21"@ifi.unizh.ch> Date: Wed, 06 Nov 1996 15:32:28 -0500 Sender: owner-urn-ietf@services.bunyip.com Precedence: bulk Reply-To: Keith Moore <moore@cs.utk.edu> Errors-To: owner-urn-ietf@bunyip.com > >> >> Your > >> >> last point would have merit were it not that we absolutely must > >> >> be able to grandfather in existing name spaces. > >> > > >> >Yes, we must be able to grandfather existing name spaces *that serve > >> >similar purposes to URNs*. ISBNs qualify. Handles qualify. > >> >Usenet message-ids qualify. But URNs don't have to handle existing > >> >name spaces that don't have the characteristics that URNs have. > > >> And the fact that you > >> don't know about any such namespaces doesn't guarantee that there > >> are no such namespaces. > > I thought I didn't know any such namespaces when I wrote this mail. > But I have become aware that my postal savings account number in > Japan contains a kanji. The set of kanji in this case is limited > to one each for each of the central offices that keeps track of > postal savings accounts. I am very sure also that there are lots > of other examlpes that fit the requirements of URNs at least as > well as ISBNs that contain kanji or other non-ASCII characters. Perhaps. I don't have as big a problem with this, as long as there's a discipline about how they are assigned that prevents URNs from being used as search strings. That's my main worry. (URNs are also supposed to be transcribable, but I recognize that only a very small set of characters is transcribable by everyone on the planet.) Sorry, I can't make heads or tails of this. > So for grandfathering, we have two choices. > > 1) We interpret it as "direct grandfathering of characters", > in which case we have to allow a really wide range > of characters (i.e. ISO 10646). > 2) We interpret it as "indirect grandfathering", in which > case the digits, or whatever small set, will be > enough. I suspect there will be some of each. But we want the transformations from "old identifier" to URN to be simple and easy to remember. Ideally, you just prepend a URN:namespace: string. In some cases it may be necessary to remove punctuation; for instance, we might want to remove '-' characters from ISBNs. But that gets specified on a per-namespace basis. > Choosing ASCII only as for URLs would be very unfair cheating. If the names aren't human meaningful, I don't see what you're complaining about. > >> I definitely hate to see i18n just being moved around and delayed > >> by some people. With UTF-8 in URNs, we have made great progress. > > > >No, this is a big step backward. Just because there is a new layer > >being defined doesn't mean that I18N belongs there. > > > >I18N *is* important -- and I'd be happy to see a draft document or > >draft charter for a working group to define a protocol (say an extension > >of http/html) to resolve human-friendly names into URNs or URLs. > > Why is there a need for an additional protocol layer? > There is no need currently for English, is it? Yes, there is. Human-friendly names are inherently ambiguous, and have ambiguous meanings which require a human being to untangle. (If I ask for "Julius Caesar", do I mean the Shakespeare play or the person or what? Most people would think the name is unambiguous-- after all, they already know what they have in mind and the other meanings will not occur to them. But if you feed "Julius Caesar" to a library catalog, you will get back large numbers of hits. By contrast, if you feed ) URNs should be precise enough that a human isn't required to disambiguate the result. > And I don't want to have to search through long lists > of possible answers just because I use Japanese, whereas > I will get one immediate answer for English. The point is that you will often get multiple answers for English. English names are no more precise than names in other languages. > >> Some protocols (notably FTP) are also going in the direction of > >> UTF-8. Ideally, we will have a uniform solution for both URLs > >> and URNs, based on UTF-8. > > > >No. Ideally, if we figure out how to do this, we will have a > >uniform solution for all URLs. I18N (and even E8H :) absolutely > >doesn't belong in URNs. > > Well, if URLs are uniformly internationalized, I wouldn't > mind about URNs. But I know that there are people, and > this probably includes you, that claim that URLs also > are not designed to be human-friendly. They weren't *designed* to be human-friendly, and they don't do a very good job of it (if nothing else, they're hard to type and you can't easily say them out loud). But URLs tend to reflect their underlying file and directory structures, which *are* (sort of) human-friendly. (really they're more author-friendly) > >For URNs, human-friendliness will be excluded by design. > > I18N and human-friendliness are not the same thing. > For direct grandfathering, I18N is needed. If you are > sure human-friendliness will not sneak into URNs > with ASCII, it will also not sneak into URNs with > UTF-8. I think URNs need restrictions on human-friendliness in either case. Keith